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Abstract. We argue that the language of Zermelo Fraenkel set the- 
ory with definitions and partial functions provides the most promising 
bedrock semantics for communicating and sharing mathematical knowl- 
edge. We then describe a syntactic sugaring of that language that pro- 
vides a way of writing remarkably readable assertions without straying 
far from the set-theoretic semantics. We illustrate with some examples of 
formalized textbook definitions from elementary set theory and point-set 
topology. We also present statistics concerning the complexity of these 
definitions, under various complexity measures. 



1 Introduction 

With the growing use of digital means of storing, communicating, accessing, and 
manipulating mathematical knowledge, it becomes important to develop appro- 
priate formal languages for the representation of such knowledge. But the scope 
of "mathematical knowledge" is broad, and the meaning of the word "appropri- 
ate" will vary according to the application. At the extremes, there are competing 
desiderata: 

— At the foundational level, one wants a small and simple syntax, and a precise 
specification of its semantics. In particular, one wants a specification as to 
which inferences are valid. 

— At the human level, one wants to have mathematical languages that are as 
easy to read and understand as ordinary mathematical texts, yet also admit 
a precise interpretation to the foundational level. 

For ordinary working mathematicians, the foundational interpretation is largely 
irrelevant, but some sort of formal semantics is necessary if the information 
encoded in mathematical texts is to be used and manipulated at the formal level. 
Of course, one solution is simply to pair each informal mathematical assertion 
with a formal translation, but then there is the problem of obtaining the formal 
translations and ensuring that they match the intention of the informal text. As 
a result, it is more promising to use semi-structured languages that integrate 
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features of both the foundational and human levels. This results in a smooth 
spectrum of languages in between the two extremes. At intermediate "expert 
user" levels, one may want a language whose structure is close to that of the 
underlying foundational framework, yet is as humanly readable as possible. 

To complicate matters, there are features of mathematical knowledge that 
are not captured at the level of assertions: mathematical language is used to 
communicate definitions, theorems, proofs, algorithms, and problems, among 
other things. At the level of a mathematical theory, language is also used to 
communicate relationships between these different types of data. The formal 
information that is relevant will vary depending on the application one has in 
mind, be it database access and search, theorem proving, formal verification, etc. 

Here we will be primarily concerned with mathematical assertions as they 
are used to state definitions and theorems.^ If one is looking for a foundational 
framework that is robust enough to subsume those used by most systems of 
MKM, it is hard to beat the language of set theory: we know of no foundational 
system other than Quine's New Foundations that cannot be interpreted in the 
language of set theory in such a way that inferences arc reduced to inferences in 
Zermelo-Fraenkel set theory with the axiom of choice (ZFC), or some plausible 
extension (say, with large universes of sets). To be clear, we are not denying 
the importance of other frameworks for more specific purposes. For example, 
the theory of real closed fields is appropriate to representing many constraint 
problems, and constructive frameworks are better suited to certain forms of 
algorithmic reasoning. It is also important to find ways of sharing the additional 
information that comes with the use of these more restricted frameworks. We 
are simply singling out set theory as a unifying framework for expressing what 
assertions in the various local frameworks have in common. 

We extend the foundational framework in two ways. First, we allow for ex- 
plicit definitions of new predicates and functions on the universe of sets. And, 
second, we allow function symbols to denote functions that are only partially de- 
fined, using a logic of partial terms. We call the resulting formal system DZFC . 
As we observe in Section 2, this system is easily shown to be conservative over 
ZFC. We argue that these extensions are not just a matter of syntactic sugar, 
but, rather, arc essential to adequate representation of the mathematical data: 
there is a difference between assertions using defined terms and their expanded 
versions, and, in mathematical terms, 1/0 really is an undefined quantity. Thus 
DZFC is our proposal for a foundational language and its semantics. 

Our main goal here is to show that the distance between this foundational 
level and ordinary mathematical text is not as far as is commonly supposed, by 
presenting a syntactically-sugared version of set theory, PST, that is simulta- 

* In passing, we note that computational proof assistants like Mizar [13], HOL [7], 
Isabella [12], Coq [3] and HOL light [8] all provide languages that can be used 
to describe mathematical proofs. Of these, the Mizar and Isabelle/Isar languages 
model human proof languages most closely. The Isar effort [16] shows that the proof 
language is somewhat orthogonal to the assertion language; that is, Isar can be 
instantiated to various foundational frameworks, subject only to minor constraints. 



neously close to both. On the one hand, we show that our language is easily 
parsed and translated to DZFC. On the other hand, by automatically replacing 
symbolic expressions with user-provided natural language equivalents, wc obtain 
output that is humanly readable, and, although not exactly literary, recognizably 
faithful to the original mathematical texts. 

We support this last claim with examples from Suppes's Axiomatic set theory 
[14] and Munkres's Topology [11]. In each case, wc present our formal input with 
both DZFC and our natural language translations. Indeed, the appendices to 
Kieffer [9] provide a corpus of 341 definitions, taken from Chapters 2-6 of Suppes 
and Sections 12-38 of Munkres. Examples of the natural language translations 
can be found in Appendix B, below. These examples show that PST offers a 
promising target semantics for mathematical markup languages, like OMDoc 
[10]. 

To illustrate the utility of PST, we describe two pieces of software that take 
advantage of both the formal structure of the definitions and their proximity 
to the informal text. First, we describe statistical studies of the complexity of 
definitions in our corpus, measured in various ways. Our analysis shows, not 
surprisingly, that expanding definitions to the pure language of set theory yields 
formulas that are huge. Perhaps more surprisingly, quantifier complexity of def- 
initions remains remarkably low, even when they are expanded to DZFC. We 
also describe software that makes it possible to explore definitional dependen- 
cies, expanding and compressing nodes via a graphical interface. To be sure, 
data like this can be mined from contemporary formal verification efforts."'' But 
mathematical developments are often changed significantly in the process of for- 
malization; what distinguishes the data presented here is the extent to which it 
faithfully represents the informal texts it is supposed to model. 

Our "user- friendly" version of set theory is based on Friedman [6]; see also 
an earlier version in Friedman [5]. Most of the work described here, including 
the implementation of the parser, the entering of the data from Suppes's and 
Munkres's books, and associated software, constitute Kieffer's MS thesis [9], 
written under Avigad's supervision. The thesis and code described here, as well 
as additional samples of the natural language translations, can be found via 
Avigad's web page.^ 

2 ZFC with definitions and partial terms 

It is widely acknowledged that Zermelo-Fraenkel axiomatic set theory with the 
axiom of choice, ZFC, is robust enough to accommodate ordinary mathematical 
arguments in a straightforward way. The most notable exceptions arc category- 
theoretic arguments which rely on the existence of large universes with suitable 
closure properties; but these can be formalized in extensions of ZFC with suitable 
large c;ardinal axioms, or by restricting the closure properties of the universes in 
question. 

^ See, for example, the MPTP challenges, http://www.cs.miami.edu/~tptp/MPTPChallenge/ 
® Specifically, see http://www.andrew.cmu.edu/user/avigad/Papers/mkm/. 



In this section, we describe a conservative extension DZFC of ZFC. This 
theory incorporates two features that allow for a more direct and natural math- 
ematical modeling: 

— it accommodates partially defined functions, and hence undefined terms; and 

— it allows the introduction of new function and predicate symbols to stand 
for explicitly defined functions and predicates. 

We describe each of these extensions, in turn. 

To start with, DZFC is based on a free logic, with a special predicate E{t). 
This is usually written ti, and can be read "i is defined" or H denotes." The 
axioms governing the terms are presented as the "logic of partial terms" in 
Beeson [2], £■+ logic in Troesltra and Schwichtenberg [15]; see also the very 
helpful explanation and overview in Feferman [4]. The basic idea is that variables 
in the language range over objects in the intended domain (in our case, sets), 
but, as function symbols may denote partial functions, some terms fail to denote. 
So, for example, the axioms for universal instantiation are given by Va; (f{x) A 
t\, fit)- The basic relation symbols of ZFC, which wc take to be G and 
=, are assumed only to hold between terms that denote; thus we have axioms 
s G t ^ si Ati and s = t — > sj, A tj. Partial equality s ~ t is defined as usual by 
the axiom s t {s [ V tl ^ s = t) . 

Next, the syntax of ordinary set theory is extended to include definition de- 
scriptions, d la Russell. Formally, for each formula 'fi{x), the expression {Lx)(f{x) 
is a term whose free variables are just those of ip, other than x. These terms are 
governed by the axioms 

y = {Lx)(fi{x) Vz {(p{z) z = y). 

Thus in DZFC one can show that (t.T)(p(x) is defined if and only if there is a 
unique y satisfying ip{y), in which case, {ix)ip{x) is equal to that y. 

Finally, one is allowed to introduce new function symbols and relation sym- 
bols to abbreviate formulas and terms. That is, for each formula 'p{x, y), one can 
introduce a new function symbol /(y) with the axiom 

f{y) - {i'X)Lp{x,y), 

and for every formula ^(y) one can introduce a new relation symbol R(y) with 
the axiom 

R{y) ^il){y). 

It is not hard to show that adding the usual axioms of set theory to this frame- 
work yields a conservative extension: 

Theorem 1. DZFC is a conservative extension of ZFC . 

The proof amounts to an interpretation of partial functions and elimination 
of definitions that is by now standard; details can be found in [15,9]. Note, 
however, that the usual method of eliminating defined function symbols and 
relation symbols by replacing them by their definiens can result in an exponential 
increase in length. 



3 The language of practical set theory, PST 



We now describe a more flexible language, Practical set theory, or PST, designed 
by Friedman. This language has two key features: 

— The language incorporates a healthy amount of syntactic sugar, making it 
possible to express ordinary mathematical definitions and assertions in a 
natural way. 

— The language is easily and eflaciently translatable to DZFC. 

In this section wo describe some of the features of PST and the translation to 
DZFC. A full and precise specification of the PST and its DZFC semantics can 
be found in [9,6], where it was called the Language of Proofless Text, or LPT. 
The claims of naturality will be supported with examples in the next section 
and in Appendix B. 

The starting point for PST is the usual syntax of first-order logic. We adopt 
conventions to distinguish between variables, defined functions, and relations; 
application of a defined relation REL to terms ti, . . . ,tk is written with square 
brackets REL[ti, . . . ,tj,], while application of a defined function Fun is written 
with parentheses, Fun{t\, . . . ,tk). The usual language of first-order logic is aug- 
mented with a significant amount of "syntactic sugar," to make the expression 
of mathematical notions as convenient as possible. These include the following. 

Function application for sets. Any term may be used as though it were a function, 
of any arity (including "infix"). For example, one may quantify a variable /, and 
then proceed to use it as though it were a function. In PST, f{x) denotes the 
unique u such that the ordered pair {x,u) is in /, assuming there is such u. 
The following definition of the unary predicate FCN therefore asserts that / is a 
function if it is a set of ordered pairs {x, u) in which no x occurs more than once 
as the first component of a pair. 



DEFINITION FS.2.58: 1-ary relation FCN. FCM[/] ^ / = {{x,y) : f{x) = y}. 



Finite sets and tuples. In the previous example, we saw a finite tuple; namely, 
the ordered pair {x, y). Tuples of any finite length are terms in PST. 

A finite set can be denoted by simply listing all of its elements. For example, in 
defining the Wiener-Kuratowski ordered pair, we may use the term {{a}, {a, h}}. 

Set-builder notation. The example above illustrates the use of set-builder nota- 
tion. In PST, the term {t : Lp} denotes the set of all values of t{xi, . . . , Xn), where 
the variables xi, . .. ,Xn occurring in t range over tuples satisfying ip{xi, . . . ,x„). 
Note that this involves an essential use of partiality; for example, in the intended 
semantics, the term {x : x = x} is undefined. 

Suppose we wish to define Image(/) to be the set of all f{x) such that 
x S Dom(/). The expression 

Iinage(/) ~ {f{x) : x € Doin(/)} 



is not what we want, because / on the right-hand side is taken to be a bound 
variable ranging over the universe of sets. Instead, PST has us write 

Iinage(/) ~ {f{x) : x e Doin(/), / fixed} 

to indicate that the expression depends on a fixed vahic of /. 

Defined function symbols. We use an exclamation mark in place of Russell's t as 
a definite description operator. It is used in the next example, where we define 
an infix function, +q, for addition on the rational numbers. Every infix function 
is given a precedence number, for use in determining order of operations. 



DEFINITION FS.5.25: Infix function +q. a; +q y ~ {\z){x, y,zeq A(3a, b, c) 
{aGxAbGyAcGzAa +sub b = c)). Precedence 40. 



A definition may be composed of any number of "If ... then ..." clauses, and 
may end with one "Otherwise ..." clause, which allows definition by cases, as in 
the example' below. In this example the 'Otherwise' clause introduces a condition 
under which the function is undefined. For this we use the predicate t, and this 
allows for the definition of partial functions. 



DEFINITION FS.2.3: 1-ary function Dom. IiBR[R] thenDom(ii) ~ {x : {3y){x R y)} 
Otherwise Doiii(i?)|. 



Defined relation symbols. As with functions, we may define infix relations, as in 
the definition of < on the rational numbers, below. 



DEFINITION FS.5.24: Infix relation <q. x<qy ^ {3z,w){x,y e<Q Az ex A 
wGy A z <suB w). 



Lambda notation. PST includes a lambda operator which can be used to bind 
variables and thereby denote functions. In the example below, we define a binary 
function called Cartespow (for "Cartesian power"). This function maps a pair 
of sets A, B to the set A^; i.e., a product of i?-many copies of A. The definition 
relies on a previously defined function, Cartesprod (for "Cartesian product"), a 
binary function taking a map / and a set C to the product over c € C of the sets 
/(c). The definition of Cartespow uses lambda abstraction to define the constant 
function 6 i— > A on the fly, to serve as the first argument to Cartesprod. 



DEFINITION MunkTop.19.2.5: 2-ary function Cartespow. Cartespow(A, B) ~ 
Cartesprod((A6 G B){A), B). 



Infix relation chains. Infix relations may be chained together in the usual way, 
as with the <r relation in the example below. 



DEFINITION MunkTop.l3.3.a.basis: 0-ary function Stdrealtopbasis. 
Stdrealtopbasis ~ {J/CIR : {3a,be'R){U = {x€R: a<RX<Rb})}. 



Bounded quantifiers. Quantified variables and variables used in set-builder no- 
tation may be bounded by any infix relation, as in the example above. 

The translation from PST to DZFC is not difficult. Since our grammar for 
PST is not LL, we used the ACCENT compiler-compiler , which implements 
Earley's algorithm. The latter can parse any context-free grammar in cubic time, 
and runs in quadratic time when the grammar is unambiguous [1] . 

Appendix A contains a number of examples of PST definitions, together 
with their translations to DZFC. In each case, we present the DZFC input, a 
I^TJt;X representation of that input generated by the parser, and the translation 
to DZFC. A much larger corpus of examples — 183 definitions from Suppes's 
Axiomatic Set Theory [14] and 148 definitions from Munkres's Topology [11] — 
can be found in [9]. In practice, the translation took at most a few seconds to 
process a file containing a dozen large definitions. Comparing the (M^) DZFC 
output with the (I^T^jX version of the) PST input yields a factor of about 0.91, 
which is to say, the DZFC translations are actually slightly shorter. 

4 Natural language output 

The examples of PST input in the last section are readable, but not attractive. 
It is hard to remember meaning of symbols "BR" or "TOPSP"; it would help 
to have phrases like "is a binary relation" or "is a topological space." In fact, 
even for logical connectives like A, natural language equivalents like "and" are 
generally easier to read. In an ordinary mathematical language text, however, 
words are not always favored over symbols. For example, defined functions are 
usually given symbols: gcd{x,y) instead of "the greatest common divisor of x 
and y." Binary relations like = and < are usually preferred to "equal to" and 
"less than." On the other hand, unary relations often represent concepts that 
are expanded to words, as shown by the examples above. 

In light of these observations, we chose to output natural language equivalents 
for the connectives, and allow the user to input natural language equivalents for 
defined symbols. For example, with the entry 

TOPSP: 20 

reln:$(#~0,#"l)$ is a y.e?topological space°/oee?@ 
negn:$(#~0,#~l)$ is not a topological space® 
plur :°/o$(#"0,#"l)$y. are topological spaces® 
nplu:7.$(#"0, #"!)$% are not topological spaces®® 

http://accent.compilertools.net/ 



the user can specify the natural language that should be used in place of the 
TOPSP relation. 

In some cases, either symbols or a natural language equivalent can be used, 

as in {a; G N I . . .} or "the set of a; € N such that " It is usually awkward to 

have natural language occur as a subterm of a symbolic expression; for example, 
consider "1 + the greatest common divisor of x and y." Thus we incorporate a 
monotonicity rule: once a subterm of a term has been expanded to natural 
language, natural language versions arc favored from then on. This choice yields, 
for example, {x € N | a < a; < 6}, but also "the set of x in N such that a<x <h 
and X is even." 

Accordingly, the user supplies two clauses for a defined function or relation 
for which symbols are preferred over words: 

\wp : 1® 

symb:$\wp(#"0)$<a 

word: the power set of #0@@ 

whereas if words are the desired default then just one clause is needed: 

Stdrealtop:OQ 

word: the standard topology on $\mathbb{R}$@@ 

Appendix B provides examples of natural language output. We emphasize 
that these were generated directly from the PST input, using the additional 
natural language data, supplied by the user, described above. Although the defi- 
nitions are not exactly literary, they are surprisingly readable, and close to ordi- 
nary mathematical text. It is certainly the case that additional heuristics could 
be used to render the output more attractive, and additional markup from the 
user would result in improvements. In other words, there is a lot more that can 
be done along these lines; our claim here is only that PST offers an auspicious 
start. 

5 Exploring definitions 

Among the benefits of having a database of definitions is the ability to explore 
those definitions interactively. We designed two simple programs with which to 
demonstrate some of the possibilities. 

Our first program allows the interactive display and manipulation of directed 
acyclic graphs (dags) of conceptual dependencies, as depicted in Figure 1. 

With a second program we gathered statistics on these graphs. Associated 
to each definition is the dag of all definitions on which it depends; by the size of 
this dag we mean the number of vertices, and by the depth of this dag we mean 
the length of its longest directed path. Table 1 shows the maximum and mean 
values for all definitions in our database. 

Additional statistics, including data on the quantifier complexity of defini- 
tions in our corpus, can be found in Appendix C. 




Fig. 1. Exploring the definition dag for the Stone-Cech compactification. 



Table 1. Max and mean dag sizes and depths 







Max 


Mean 


All 


Depth 


32 


10.77 




Size 


110 


29.56 


Suppes 


Depth 


26 


10.09 




Size 


77 


25.91 


Munkres 


Depth 


32 


12.25 




Size 


110 


36.01 



6 Conclusions 



We have argued that one should adopt a language close to definitional set theory 
as a uniform language to support communication and exchange of mathematical 
results. The particular language we describe here, Practical set theory, fares well 
in that regard: it is easy and natural to work with, providing a high-degree of 
readability while remaining close to a clear foundational semantics. 

Appendix A: Examples of PST input and DZFC 
translations 

We consider a few examples of formal definitions, highlighting the natural- 
ity of PST over DZFC. (The tuq function appearing in the DZFC transla- 
tions is a function defined to take (a, b) to the Wiener-Kuratowski ordered pair 
{{a}, {a, 6}}.) 



Example 1. Here the description operator is used in PST to bind an ordered 
pair, so that we are able to refer to "the unique ordered pair (F, T') such that...." 
This translates to a much clumsier expression in DZFC, requiring two additional 
bound variables. 

PST input: 

DEFINITION MunkTop . 29 . 4 : 2-ary function Oneptcompactif ication. 
If TOPSP[X,T] then Oneptcompactif ication(X,T) \simeq 
(!<Y,T'>) ( 

COMPACTIFICATION[Y,T' ,X,T] \wedge Y \less X \approx_{C> 1_-CN} 

). 

PST rendered in &TeX: 

DEFINITION MunkTop.29.4: 2-ary function Oneptcompactif ication. If 

TOPSP[X,r] then Oneptcompactif ication(X, T) ~ (! {Y,T')) 
(COMPACTIFICATION[r, T', X, T] A Y\X «c In)- 

DZFC translation: 

Oneptcompactif ication(X,T) ~ (;,?;o)(TOPSP[X, T] A j/o - {lxq){3Y,T'){xo = 
zuo{Y, T') a (COMPACTIFICATION[r, T , X, T] A !==ic[\{Y, X), U]))) 



Example 2. Next observe what happens in DZFC, where we cannot match 
the brevity of expression used in our definition of the FCN[f] predicate in PST 
(which says that f is a function). 

PST input: 

DEFINITION FS.2.58: 1-ary relation FCN. FCN[f] \iff 
f = {<x,y> : f (x) = y}. 



PST rendered in BTeK: 

DEFINITION FS.2.58: 1-ary relation FCN. FCM[/] ^ / = {{x,y) : f(x) = y}. 
DZFC translation: 

FCN[/] ^ / = {tzo){yyo){yo ^ zq ^ {3x,y){yo = wo{x,y) A {{iXo){wo{x,XQ) € 

f) = y))) 

Example 3. Here we see how important the lambda operator is: 

PST input: 

DEFINITION MunkTop . 19 . 2 . 5 : 2-ary function Cartespow. Cartespow(A,B) 
\simeq CartesprodC (\lambda b \in B)(A),B). 

PST rendered in &TEpC: 

DEFINITION MunkTop. 19.2.5: 2-ary function Cartespow. Cartespow(A, B) 
Cartesprod((A6 G B){A), B). 

DZFC translation: 

Cartespow(A, B) ~ Cartesprod(((.2;o)(Vyo)(2/o & zq (36, a;o)(yo = ^^0(6, a;o) A 
xo = {A)hb&B)),B) 



Appendix B: Examples of the natural language translations 

In some cases our natural language generating program pst2nl produces output 
that is quite close to what a human being might write. For example, from the 

following PST input, 

DEFINITION MunkTop.13.2: 2-ary function Basisgentop. If TOPBASIS[^, X] 
thenBasisgentop(ii',X) ~ (!^ C p(X))((V[/ C G ^ ^ (Vx G C/)(3B G ^) 

(a; G S A B C U))). 

we get the following NL (natural language) output: 

Definition: If ^ is a basis for a topology on X then the topology on X 

generated by SS is the unique 3" C p{X) such that for every U X ,1} ^ 3" \i 
and only if for every x £ U, there exists B G ^ such that x G B and B C U. 



Indeed, this is not substantially different from the original text in Munkres 
[11], page 78. After defining what it means for to be a basis, Munkres says. 

If ^ satisfies these two conditions, then we define the topology !7 gener- 
ated by ^ as follows: A subset ?7 of X is said to be open in X (that is, 
to be an element of ^) if for each x E U, there is a basis clement i? G ^ 
such that X G B and B C U. Note that each basis element is itself an 
element of 



What is more common is that the output of pst2nl reads nicely except for 
a "run-on" sound, resulting from insufficient punctuation. For example: 

Definition: If i? is a strong simple order on X then the basis for the order 

topology on {X, R) is the set of U such that there exist a, b Cz X such that U = 
(a, b) or a is a first element in X and U = [a, b) or 6 is a last element in X and 
U = (a, 6]. 



In Munkres, page 84, all of this information is spread out over a numbered 

list: 

Definition. Let X be a set with a simple order relation; assume X has 
more than one element. Let ^ be the collection of all sets of the following 

types: 

1. All open intervals (a, b) in X. 

2. All intervals of the form [ao,b), where ao is the smallest element (if 

any) of X. 

3. All intervals of the form {a, bo], where bo is the largest element (if 
any) of X. 

The collection ^ is a. basis for a topology on X, which is called the order 
topology. 

Heuristics, combined with additional user markup, could eventually be incor- 
porated to help improve the flow and punctuation of the translations. We have 
implemented one easy improvement already, whereby adjacent assertions of a 
common predicate are combined into a single assertion using plural form. Thus, 
from the PST input, 

DEFINITION MunkTop.l2.4.a: 3-ary relation FINERTOP. If TOPSP[X, ^] A 
TOPSP[X, then FINERTOP[^', ^, X] ^ ^' D ^. 

we obtain: 

Definition: If {X, and {X, are topological spaces then ^' is finer 
than ^ on X if and only if 5" D £r. 



This time Munkres is able to make several definitions in a single paragraph, 
and can abbreviate a more complex logical locution with the phrase "respective 
situations." 

Definition. Suppose that .'7 and ,'7' are two topologies on a given set 
X. If D we say that ^' is finer than if ^' properly contains 
we say that ^' is strictly finer than ^. We also say that ^ is coarser 
than or strictly coarser, in these two respective situations. We say 
^ is comparable with ^' if either ^' D ^ or ^ D 



We consider a final example. 



DEFINITION MunkTop.l3.3.c: 0-ary function Krealtop. Krealtop ~ 
Basisgeiitop(StdrealtopbasisU{F CK : (3W G Stdrealtopbasis) 

{V = VK\{lnclFrR(lN/n) : nGN})},R). 

for which the NL output is as follows: 

Definition: The K-topology on R is the topology on R generated by the 
standard basis for a topology on R union the set of V C R such that there exists 
W in the standard basis for a topology on R such that V = W \ {1/n : n gN}. 



There are two sets mentioned in this definition: the set of V C R such that 
and the set of 1/n such that .... According to the "monotonicity rule" described 
in Section 4, the latter is rendered in symbols since it has no subterm in words; 
the former is rendered in words since its subterm, "the standard basis for a 
topology on R" has no symbolic form, and is displayed in words by default. 

Another feature of pst2nl is apparent in this last example, where the word 
"in" appears before "the standard basis...." We get this preposition rather than 
the incorrect phrase "is in," thanks to the final clause in the user-supplied natural 
language equivalents for the e relation: 

\in: inf ixQ 

symb:#0 $\in$ #10 

nsym:#0 $\not\in$ #1® 
reln:#0 is %e?±n%ee? #1Q 
negn:#0 is not in #1@ 
plur:y.#Oy. are in #10 
nplu:yo#Oyo are not in #19 
prep:#0 in #im 

Finally we note that the user is free to suppress artifacts of formalization, 
in the NL output. In the PST above there is an inclusion function InclprR, and 
the number 1 is subscripted as In. None of this shows up in the NL output. 

Comparison with Munkrcs, page 82, reveals that he is free to write in a less 
regimented form than that of our definitions: 

Finally, let K denote the set of all numbers of the form l/n, for n G 
and let ^" be the collection of all open intervals (a, 6), along with all 
sets of the form (a, b) — K. The topology generated by ^" will be called 
the K-topology on R. 



Appendix C: Data on quantifier complexity and length 

Our database of definitions entered in PST consists of 183 definitions from Sup- 
pes's Axiomatic Set Theory [14] and 148 definitions from Munkres's Topology 
[11]. 



Quantifier complexity data. For each definition in our database, we measured 
quantifier complexity in eight different ways. In the first place, we considered 
both alternating quantifier depth, and non-alternating. Secondly, we considered 
each definition in four different states: (1) as given in PST; (2) as translated 
into DZFC; (3) the expanded version of the DZFC, that is, with all definienda 
replaced by their definiens, recursively, until the process halts; and (4) a par- 
tially expanded version of the DZFC in which certain low-level, foundational 
definienda were left unexpanded, namely: the union, intersection, and set differ- 
ence operations, the ordered pair, and powerset functions, the empty set, and 
the subset and superset relations. The maximum and mean depths are presented 
in Table 2. 



Table 2. Max and irican quantifier depths 





Max 


Mean 


PST 


4 


0.66 


unexpanded DZFC 


5 


1.31 


fully expanded DZFC 


1235 


78.68 


partially expanded DZFC 


552 


38.54 


PST alternating 


3 


0.63 


unexpanded DZFC alternating 


5 


1.18 


fully expanded DZFC alternating 


422 


36.19 


partially expanded DZFC alternating 


239 


22.16 



It has been said that among actually occurring definitions in mathematics 
texts, the maximum alternating quantifier depth is three. Insofar as PST comes 
close to what actually occurs in textbooks, the maximum alternating depth of 3 
tends to confirm this conjecture. 

Note that the maximum depth after translating into DZFC goes up to 5. This 
reflects what wc saw in Appendix A, where a definition that used no quantifiers 
in PST turned out to require them after translation into DZFC. 

The max;imum depth of 1235 for a fully expanded definition confirms the 
necessity of using definitions to package information into manageable chunks. 
Meanwhile, the contrast between the total expansion maximum, and the partial 
expansion maximum of 552, demonstrates that the lowest, most foundational 
definitions, lend quite a bit of this complexity. 

The ratio 78.68/36.19 2.17 of the mean fully expanded depth to the mean 
fully expanded alternating depth suggests that quantifiers often occur in runs 
of two, before alternating, when definitions arc written in pure set theory. The 
somewhat lower ratio of 38.54/22.16 « 1.74 for the partially expanded cases 
indicates the extent to which the lowest-level concepts contribute to this doubling 
of consecutive quantifiers. 

The mean depth for PST alternating (again, what comes closest to what 
we ordinarily think of as quantifier depth in textbooks) shows that, while the 



maximum is three, the most common depths are and 1. The exact number of 
occurrences are presented in Table 3. 

Table 3. Quantifier depth frequencies in PST 





Occurrences 


Depth 


PST 


PST ahernating 





178 


178 


1 


118 


120 


2 


30 


35 


3 


14 


8 


4 


1 






Length data. As was expected, there is rapid blowup in the size of definitions 
when they arc expanded. In collecting our data we set a maximum of 2^"^ — 1 
before we stopped counting, and this maximum was often reached. 

In particular, since the development of the real numbers taken from Sup- 
pes [14] involves such deep definition trees, any definition mentioning the real 
numbers will have enormous expanded length. For example, the definition of the 
basis for the standard topology on the reals (see Section 3) is just 303 symbols 
long after initial translation into DZFC, but blows up to over 2^^ — 1 symbols 
after expansion. 

The longest definition we formalized from Suppes [14] was 526 symbols, and 
the longest from Munkres [11] was 714 symbols. 
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